A Study of Various Clustering Algorithms on Retail Sales Data
نویسندگان
چکیده
Data mining is the process of extraction of Hidden knowledge from the databases. Clustering is one the important functionality of the data mining Clustering is an adaptive methodology in which objects are grouped together, based on the principle of optimizing the inside class similarity and minimizing the class-class similarity. Various clustering algorithms have been developed resulting in a better performance on datasets for clustering. The paper discusses the four major clustering algorithms: KMeans, Density based, Filtered, Farthest First clustering algorithm and comparing the performances of these principle clustering algorithms on the aspect of correctly class wise cluster building ability of algorithm .The results are tested on datasets of retail sales using WEKA interface and compute the correctly cluster building instances in proportion with incorrectly formed cluster. A comparison of these four algorithms is given on the basis of percentage of incorrectly classified instances.
منابع مشابه
Use of the Improved Frog-Leaping Algorithm in Data Clustering
Clustering is one of the known techniques in the field of data mining where data with similar properties is within the set of categories. K-means algorithm is one the simplest clustering algorithms which have disadvantages sensitive to initial values of the clusters and converging to the local optimum. In recent years, several algorithms are provided based on evolutionary algorithms for cluster...
متن کاملA Comparative Study of Some Clustering Algorithms on Shape Data
Recently, some statistical studies have been done using the shape data. One of these studies is clustering shape data, which is the main topic of this paper. We are going to study some clustering algorithms on shape data and then introduce the best algorithm based on accuracy, speed, and scalability criteria. In addition, we propose a method for representing the shape data that facilitates and ...
متن کاملAssessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملRetail Market analysis in targeting sales based on Consumer Behaviour using Fuzzy Clustering - A Rule Based Mode
Product Bundling and offering products to customers is of critical importance in retail marketing. In general, product bundling and offering products to customers involves two main issues, namely identification of product taste according to demography and product evaluation and selection to increase sales. The former helps to identify, analyze and understand customer needs according to the demo...
متن کاملGenerating Customer Profiles for Retail Stores Using Clustering Techniques
The retail industry collects huge amounts of data on sales, customer buying history, goods transportation, consumption, and service. With increased availability and ease of use of modern computing technology and e-commerce, the availability and popularity of such businesses has grown rapidly. Many retail stores have websites where customers can make online purchases. These factors have resulted...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012